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Abstract 

We consider directed graphs over a set of n agents, where an edge (/, jf) is taken to mean that agent i 
supports or trusts agent j. Given such a graph and an integer k < n, we wish to select a subset of k agents 
that maximizes the sum of indegrees, i.e., a subset of k most popular or most trusted agents. At the same 
time we assume that each individual agent is only interested in being selected, and may misreport its 
outgoing edges to this end. This problem formulation captures realistic scenarios where agents choose 
among themselves, which can be found in the context of Internet search, social networks like Twitter, or 
reputation systems like Epinions. 

Our goal is to design mechanisms without payments that map each graph to a fc-subset of agents 
to be selected and satisfy the following two constraints: strategyproofness, i.e., agents cannot benefit 
from misreporting their outgoing edges, and approximate optimality, i.e., the sum of indegrees of the 
selected subset of agents is always close to optimal. Our first main result is a surprising impossibility: 
for k G {1, . . . ,n — 1}, no deterministic strategyproof mechanism can provide a finite approximation 
ratio. Our second main result is a randomized strategyproof mechanism with an approximation ratio that 
is bounded from above by four for any value of k, and approaches one as k grows. 
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1 Introduction 



One of the most well-studied settings in social choice theory concerns a set of agents (also known as voters 
or individuals) and a set of alternatives (also known as candidates). The agents express their preferences 
over the alternatives, and these are mapped by some function to a winning alternative or set of winning 
alternatives. In one prominent variation, each agent must select a subset of alternatives that it approves; this 
setting is known as Approval voting |6j. 

We consider the special case of Approval voting when the set of agents and the set of alternatives 
coincide. Specifically, in our model there is an underlying directed graph, with the agents as vertices. An 
edge from agent i to agent j implies that agent i approves, votes for, trusts, or supports agent j. Our goal 
is to select a subset of k "best" agents, based on the given graph; we elaborate on what we mean by "best" 
momentarily. 

Our assumption that agents and alternatives coincide enables us to restrict the agents' preferences. In- 
deed, we assume that each agent is only interested in whether it is among those selected, that is, it receives 
utility one if selected and zero otherwise. This assumption reflects, in the limit, a situation where each 
agent gives very small weight to the overall composition of the selected subset, and very high weight to the 
question of its own selection^ 

As a first motivating example, consider an Internet search setting. The web sites are the agents, while 
hyperlinks are represented by edges. Given this graph, a search engine must return a set of the, say, ten top 
web sites. Put another way, the top web sites are selected based on the votes cast by other web sites in the 
form of hyperlinks. Each specific web site or, more accurately, its webmaster is naturally concerned only 
with appearing at the top of the search results, and to this end may add or remove hyperlinks at will. 

A (deterministic) k-selection mechanism is a function that maps a given graph on the set of agents to a 
^-subset of selected agents. We also consider randomized /c-selection mechanisms, which randomly select a 
subset. 

The outgoing edges in the underlying graph G are private information of the respective agent. Fixing a 
mechanism /, the agents play the following game. Each of them reports to the mechanism a set of outgoing 
edges, which might differ from the true set. The reported edges induce a graph G', and the mechanism selects 
the subset f(G'). We say that a mechanism is strategyproof (SP) if an agent cannot benefit from misreporting 
its outgoing edges, that is, cannot increase its chances of being selected, even if it has complete information 
about the rest of the graph. We further say that a mechanism is group strategyproof (GSP) if even a coalition 
of agents cannot all gain from misreporting their outgoing edges. 

We now explain what we mean by selecting the "best" agents. In this paper, we measure the quality of 
a set of agents by their total number of incoming edges, i.e., the sum of their indegrees. The goal of the 
mechanism designer is to optimize this target function. Note that this goal is in a sense orthogonal to the 
agent's interests, which may make the design of good SP mechanisms difficult. 

A second motivating example can be found in the context of social networks. While some social net- 
works, like Facebook ( http : / / f acebook . corn) , correspond to undirected graphs, there are many exam- 
ples with unilateral connections. Each user of the reputation system Epinions ( [http : //epinions . com] ) 
has a "Web of Trust", that is, the user unilaterally chooses which other users to trust. Another prominent ex- 
ample is the social network Twitter ( http : //twitter . com] ), which of late has become wildly popular; 
a Twitter user may choose which other users to "follow." 

In "directed" social networks, choosing a fc-subset with maximum overall indegree simply means select- 
ing the k most popular or most trusted users. Applications include setting up a committee, recommending 

'See Section[5]for further discussion of this utility model. 
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a trusted group of vendors, targeting a group for an advertising campaign, or simply holding a popularity 
contest. The last point may seem pure fantasy, but, indeed, celebrity users of Twitter have recently held a 
race to the milestone of one million followers; the dubious honor ultimately went to actor Ashton Kutcher. 
Clearly Mr. Kutcher could increase the chance of being selected by not following any other users, that is, 
reporting an empty set of outgoing edges. 

Since a mechanism that selects an optimal subset (in terms of total indegree) is clearly not SP, we will 
resort to approximate optimality. More precisely, we seek SP mechanisms that give a good approximation, in 
the usual sense, to the total indegree. Crucially, approximation is not employed in this context to circumvent 
computational complexity (as the problem of selecting an optimal subset is obviously tractable), but in order 
to sufficiently broaden the space of acceptable mechanisms to include SP ones. 

Context and related work. The work in this paper falls squarely into the realm of approximate mecha- 
nism design without money, an agenda recently introduced by some of us (Procaccia and Tennenholtz [22]), 
building on earlier work (for example by Dekel et al. iTTOl ). This agenda advocates the design of SP approxi- 
mation mechanisms without payments for structured, and preferably computationally tractable, optimization 
problems. Indeed, while almost all the work in the field of algorithmic mechanism design [21] considers 
mechanisms that are allowed to transfer payments to and from the agents, money is usually unavailable in 
Internet domains like the ones discussed above (social networks, search engines) due to security and ac- 
countability issues (see, e.g., the book chapter by Schummer and Vohra |[24l ). Our notion of a mechanism, 
sometimes referred to as a social choice rule in the social choice literature, therefore precludes payments 
by definition. Note that Procaccia and Tennenholtz [22], and also Alon et al. [1], deal with a completely 
different domain, namely facility location. 

For k = 1 , that is, if one agent must be selected, the game we deal with is a special case of so-called 
selection games H, where the possible strategies are the outgoing edges. More generally, this setting is 
related to work in distributed computing on leader election (see, e.g., E El |5l). This line of work 
does not deal with self-interested agents. Instead, there is a certain number of malicious agents trying to 
manipulate the selection process, and the goal is to guarantee the selection of a non-malicious agent, at least 
with a certain probability. 

Finally, this paper is related to work on manipulation of reputation systems, which are often modeled as 
weighted directed graphs; a reputation function maps a given graph to reputation values for the agents (see, 
e.g., GJQ3]]). Although our positive results can be extended to weighted graphs, when the target function 
is the sum of weights on incoming edges, this would hardly be a reasonable target function. Indeed, in this 
context the absence of a specific incoming edge (which indicates lack of knowledge) is preferable to an edge 
with low weight (which indicates distrust); see Section[5]for further discussion. 

Our results and techniques. We give rather tight upper and lower bounds on the approximation ratio 
achievable by ^-selection mechanisms in the setting described above; the properties of the mechanisms fall 
along two orthogonal dimensions: deterministic vs. randomized, and SP vs. GSP A summary of our results 
is given in Table Q] 

Our contribution begins in Section [3] with a study of deterministic /c-selection mechanisms. It is quite 
easy to see that no deterministic SP 1 -selection mechanism can yield a finite approximation ratio. Intu- 
itively, this should not be true for large values of k. Indeed, in order to have a finite approximation ratio, a 
mechanism should very simply select a subset of agents with at least one incoming edge, if there is such a 
set. In the extreme case when k = n — 1, we must select all the agents save one, and the question is whether 
there exists an SP mechanism that never eliminates the unique agent with positive indegree. Our first result 
gives a surprising negative answer to this question, and in fact holds for every value of k. 
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Table 1: Summary of our results for ^-selection mechanisms, where n is the number of agents. SP stands for strate- 
gyproof, GSP for group strategyproof. 

Theorem |3.1[ Let N = { 1 , . . . , n}, n>2, and k € {1 , • ■ • ,n — 1}. Then there is no deterministic SP k-selection 
mechanism that gives a finite approximation ratio. 

The proof of the theorem is compact but rather tricky. It involves two main arguments. We first restrict 
our attention to a subset of the graphs, namely to stars with all edges directed at a specific agent. An SP 
mechanism over such graphs can be represented using a function over the boolean (n — l)-cube, which must 
satisfy certain constraints. We then use a parity argument to show that the constraints lead to a contradiction. 

In Section [4] we turn to randomized ^-selection mechanisms. We design a randomized mechanism, 
Random m-Partition (m-RP), parameterized by m, that works by randomly partitioning the set of agents into 
m subsets, and then selecting the (roughly) k/m agents with largest indegree from each subset, when only the 
incoming edges from the other subsets are taken into account. This rather simple technique is reminiscent 
of work on random sampling in the context of auctions for digital goods lfl4l [T8l [T3l and combinatorial 
auctions ifTTTl . although our problem is fundamentally different. We have the following theorem. 

Theorem @TD LetN = {l,...,n}, k € {1, . . . ,n- 1}. For every value of m, m-RP is SP. Furthermore, 
1. 2-RP has an approximation ratio of four, and 
1. ( 3 ] ) 'has an approximation ratio of \ + 0(1/ k 1 /^). 

For a given number k of agents to be selected, we can in fact choose the best value of m when applying 
m-RP. Thus, there exists a mechanism that always yields an approximation ratio of at most four, and further- 
more provides a ratio that approaches one as k grows. In addition, we prove a lower bound of 1 + Q.(\/k 2 ) 
on the approximation ratio that can be achieved by any randomized SP ^-selection mechanism; in particular, 
the lower bound is two for k = 1. 

As our final result, we obtain a lower bound of (n — l )/k for randomized GSP ^-selection mechanisms. 
This result implies that when asking for group strategyproofness one essentially cannot do better than simply 
selecting k agents at random, which is obviously GSP and gives an approximation ratio oin/k. 

2 The Model 

Let N = { 1 ,...,«} be a set of agents. For each k = 1, . . . ,n, let =5^ = =5^(n) be the collection of ^-subsets of 
N, i.e., = {S C N : \S\ = k}. We consider directed graphs G = (N,E), that is, graphs with Af as the set 
of vertices, and write W = 9?(N) for the set of such graphs. 

A deterministic k-selection mechanism is a function / : ^ — » ^ that selects a subset of agents for each 
graph. When the subset S C ,/V is selected, agent i 6 N obtains utility = 1 if i G 5 and Ui(S) = 
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otherwise, i.e., agents only care about whether they are selected or not. We further discuss this utility model 
in Section [3] 

A randomized k-selection mechanism is a function / : — > A(S p k), where A(J^) is the set of probability 
distributions over J^. Given a distribution /X G A( t 5^ : ), the utility of agent i EN is 

u i (n)=E s ^[u i (S))='Pr s ^{ieS}. 

Deterministic mechanisms can be seen as a special case of a randomized ones, always selecting a set of 
agents with probability one. 

We say that a ^-selection mechanism is strategyproof (SP) if an agent cannot benefit from misreporting 
its edges. Formally, strategyproofness requires that for every i G N and every pair of graphs G,G' £ ^ that 
differ only in the outgoing edges of agent i, it holds that w,-(G) = m,-(G')H This means that the probability of 
agent i G N being selected has to be independent of the outgoing edges reported by i. A discussion of this 
definition in the context of randomized mechanisms can be found in Section [5] 

A ^-selection mechanism is group strategyproof (GSP) if there is no coalition of agents that can all gain 
from jointly misreporting their outgoing edges. Formally, group strategyproofness requires that for every 
5 C N and every pair of graphs G,G' £ <S that differ only in the outgoing edges of the agents in S, there 
exists i £ S such that m,(G) < Ui(G'). An alternative, stronger definition requires that some agent strictly lose 
as a result of the deviation. Crucially, our result with respect to group strategyproofness is an impossibility, 
hence using the weaker definition only strengthens the result. 

Given a graph G, let deg(/) = deg(/,G) be the indegree of agent i in G, i.e., the number of its incom- 
ing edges. We seek mechanisms that are SP or GSP, and in addition approximate the optimization target 
Liesdeg(i), that is, we wish to maximize the sum of indegrees of the selected agents. Formally, we say that 
a ^-selection mechanism / has an approximation ratio of a if for every graph G, 

max 5£J4 Ei6sdeg(/) < 
E S~/(G)EieS deg(01 - 

3 Deterministic Mechanisms 

In this section we study deterministic /c-selection mechanisms. Before stating our impossibility result, we 
discuss some special cases. 

Clearly, only one mechanism exists for k = n, that is, when all the agents must be selected, and this 
mechanism is optimal. More interestingly, it is easy to see that one cannot obtain a finite approximation ratio 
via a deterministic SP mechanism when k = 1. Indeed, let n > 2, let / be an SP deterministic mechanism, and 
consider a graph G = (N,E) with E = {(1,2), (2, 1)}, i.e., the only two edges are from agent 1 to agent 2 and 
vice versa. Without loss of generality we may assume that f(G) = { 1}. Now, assume that agent 2 removes its 
outgoing edge; formally, we now consider the graph G' = (N,E') with E' = {(1,2)}. By strategyproofness, 
f(G') = {1}, but now agent 2 is the only agent with positive degree, hence the approximation ratio of / is 
infinite. 

Note that in order to have a finite approximation ratio, our mechanism must satisfy the following prop- 
erty, which is also sufficient: if there is an edge in the graph, the mechanism must select a subset of agents 
with at least one incoming edge. The argument above shows that this property cannot be satisfied by any SP 
mechanism when k = 1, but intuitively it should be easy to satisfy when k is very large. 

2 By symmetry, this is equivalent to writing the last equality as an inequality. 
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Consider, for example, the case where k = n — 1, that is, the mechanism must select all the agents save 
one. Can we design an SP mechanism with the extremely basic property that if there is only one agent with 
incoming edges, that agent would not be the only one not to be selected? 

In the following theorem, we give a surprising negative answer to this question, even when we restrict our 
attention to graphs where each agent has at most one outgoing edge. Amusingly, a connection to the popular 
TV game show "Survivor" can be made. Consider a slight variation where each tribe member can vote for 
one other trusted member, but is also allowed not to cast a vote. One member must be eliminated at the tribal 
council, based on the votes. Since each member's first priority is not to be eliminated (i.e., to be selected), 
strategyproofness in our 0-1 utility model is in fact a necessary condition for strategyproofness in suitable, 
more refined utility models. The theorem then implies that a mechanism for choosing the eliminated member 
cannot be SP (even under 0-1 utilities) if it has the property that a member who is the only one that received 
votes cannot be eliminated. Put another way, lies are inherent in the game! 

More generally, we show that for any value of k, strategyproofness and finite approximation ratio are 
mutually exclusive. The proof is concise but nontrivial. 

Theorem 3.1. Let N = {1, . . . ,n}, n>2, and k G {1, ... ,n — 1}. There is no deterministic SP k-selection 
mechanism that gives a finite approximation ratio. 

Proof. Assume for contradiction that / : & — > S^k is a deterministic SP ^-selection mechanism that gives a 
finite approximation ratio. Furthermore, let G* = (N,0) be the empty graph. Since k <n, there exists i G N 
such that i f(G*); without loss of generality, n ^ f(G*). 

We will restrict our attention to stars whose center is agent n, that is, graphs where the only edges are of 
the form (i,n) for an agent i G N\ {«}. We can represent such a graph by a binary vector x= (jci , . .. 
where x, = 1 if and only if the edge (i, 1) is in the graph; see Figure Q]for an illustration. In other words, we 
restrict the domain of / to {0, 1}" _1 . 

We claim that n G fix) for all x 6 {0, \ {0}. Indeed, in every such graph agent n is the only 
agent with incoming edges. Hence, any subset that does not include agent n has zero incoming edges, and 
therefore does not give a finite approximation ratio (as a subset that does include agent n has at least one 
incoming edge). 

To summarize, / satisfies the following three constraints: 

1. «£/(0). 

2. For all x G {0, l}"" 1 \ {0}, n G /(x). 

3. Strategyproofness: for all i G N\ {n} and x G {0, 1}"~\ i G f(x) if and only if i G /(x + <?,), where <*,• 
is the ith unit vector and addition is modulo 2. 

Next, we claim that |{x G {0, 1}" _1 : i G /(x)}| is even for all i G N\ {«}. This follows directly from the 
third constraint, strategyproofness: we can simply partition the set {x G {0, 1}" _1 : i G /(x)} into disjoint 
pairs of the form {x, x + ef\. 

Finally, we consider the expression I^xefo.i}"- 1 l/( x )l- On one hand, we have that 



£ |/(x)| = ^|{xG;{0,ir- 1 : 



i€/«}| 



xejo,!}"- 1 ieAf 



(1) 



= (2- 1 -l)+ I 



|{XG{0,1} 



n-l . 



: i€/(x)}| 



ieN\{n} 
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(a) The vector (1,0, 1,1,0,0) 



(b) The vector (1,1,0,0,0,1) 



Figure 1: Correspondence between stars and binary (n — l)-vectors, for n = 7 



where the second equality is obtained by separating |{x G {0, 1}" 1 : n G /(x)}| from the sum, and observing 
that it follows from the first two constraints that this expression equals 2" _1 — 1. Since 2" _1 — 1 is odd and 
LieN\{n} |{x G {0, l}" -1 : i G /(x)}| is even, © implies that I xe{0 ,i}»-i |/(x)| is odd. 
On the other hand, it trivially holds that 



It is interesting to note that if we slightly change the problem formulation by allowing the selection of 
at most k agents for k > 2 then it is possible to design a curious deterministic SP mechanism with a finite 
approximation ratio that selects at most two agents. The reader is referred to Appendix |B] for more details. 

4 Randomized Mechanisms 

In Section [3] we have established a total impossibility result with respect to deterministic SP ^-selection 
mechanisms. In this section we ask to what extent this result can be circumvented using randomization. 

4.1 SP Randomized Mechanisms 

As we move to the randomized setting, it immediately becomes apparent that Theorem |3.1| no longer applies. 
Indeed, a randomized SP ^-selection mechanism with a finite approximation ratio can be obtained by simply 
selecting k agents at random. However, this mechanism still yields a poor approximation ratio. Can we do 
better? 

Consider first a simple deterministic mechanism that partitions the agents into two predetermined subsets 
S\ and S2- Next, the mechanism discards all edges between pairs of agents in the same subset. Finally, 
the mechanism chooses the top k/2 agents from each subset. In other words, the mechanism selects the 
k/2 agents with highest indegree from each subset, where the indegree is calculated only on the basis of 
incoming edges from the other subset. This mechanism is clearly SP. Indeed, consider some i G S t , t G {1,2}; 
its outgoing edges to agents inside its subset are disregarded, whereas its outgoing edges to agents in S^-t 
can only influence which agents are selected from 53 _ f . However, even without Theorem 13. II it is easy to 
see that the mechanism does not yield a finite approximation ratio, since it might be the case that the only 
edges in the graph are between agents in the same subset. 

We leverage and refine the partition idea in order to design a randomized SP mechanism that yields a 
constant approximation ratio. More accurately, we define an infinite family of mechanisms, parameterized 



£ |/( X )|= £ k = r- l -k 



hence 



xe{0,i}"-' xejoj}"- 1 
Lxe{o l/( x )l i s even. We have reached a contradiction. 
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(a) The given graph 



(b) The partitioned graph 



Figure 2: Example for the Random 2-Partition Mechanism, with n = 6 and k = 2. Figure [2(a)] illustrates the given 
graph. The mechanism randomly partitions the agents into two subsets, shown in Figure [2(b)] and disregards the edges 
inside each group. The mechanism then selects the best agent in each group based on the incoming edges from the 
other group; in the example, the selected subset is {1,5}, with a sum of indegrees of four, whereas the optimal subset 
is {2,5}, with a sum of indegrees of five. 

by a parameter m G N. Given m, the mechanism randomly partitions the set of agents into m subsets, and 
then selects (roughly) the top k/m agents from each subset, based only on the incoming edges from agents 
in other subsets. Below we give a more formal specification of the mechanism; an example can be found in 
Figure |2] 

The Random ra-Partition Mechanism (m-RP) 

1. Assign each agent independently and uniformly at random to one of m subsets Si,...,S m . 

2. Let T C {1, . . . ,m} be a random subset of size k — m- [k/m\ . 

3. If t G T , select the \k/m\ agents from S t with highest indegrees based only on edges from N\S t . If 
t ^ T, select the \k/m\ agents from S t with highest indegrees based only on edges from N\S t . Break 
ties lexicographically in both cases. If one of the subsets S t is smaller than the number of agents to be 
selected from this subset, select the entire subset. 

4. If only k' <k agents were selected in Step 3, select k — k' additional agents uniformly from the set of 
agents that were not previously selected. 

Note that if k = 1 and m = 2 then we select one agent from one of the two subsets, based on the incoming 
edges from the other. In this case, step 2 is equivalent to a toss of a fair coin that determines from which of 
the two subsets we select an agent. 

As in the deterministic case, given a partition of the agents into subsets S\,...,S m , the choice of agents 
that are selected from S t is independent of their outgoing edges. Furthermore, the partition is independent 
of the input. Therefore, m-RP is spI The following theorem explicitly states the approximation guarantees 
provided by m-RP; the technical and rather delicate proof of the theorem is relegated to Appendix lAl 

Theorem 4.1. Let N = {1, . . . ,n}, k G {1 , . . . ,n — 1}. For every value ofm, m-RP is SP. Furthermore, 

1. 2-RP has an approximation ratio of four, and 

3 The mechanism is even universally SP, see Section[5] 
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2. ( |~fc 1//3 ~|)-/?P has an approximation ratio of 1 + &{\jk x l^\ 

In fact, we can choose the best value of m for any given value of k when we apply m-RP. In other 
words, Theorem 14. 1 1 implies that for every k there exists an SP mechanism with an approximation ratio of 
min{4, 1 + ^(l/fc 1 / 3 )}, that is, an approximation ratio that is bounded from above by four for any value of 
k, and approaches one as k grows. 

It follows from the theorem that, for k = 1, 2-RP has an approximation ratio of four; for this case m-RP 
with m > 2 has a strictly worse ratio. It is interesting to note that the analysis is tight. Indeed, consider a 
graph G = (N,E) with only one edge from agent 1 to agent n, that is, E = {(l,n)}. Assume without loss of 
generality that agent n is assigned to Si . In order for agent n to be selected, two events must occur: 

1. T = {1}, that is, the winner must be selected from Si. This happens with probability 1/2. 

2. Either 1 G S2, or |Si | = 1. The probability that 1 G S2 is 1/2. The probability that |Si| = 1, given that 
n G Si, is l/2"~ ! . By the union bound, the probability of this event is at most 1/2+ 1/2" -1 . 

It is clear that n cannot be selected unless the first event occurs. If the second event does not occur, it follows 
that n has an indegree of zero based on the incoming edges from S2, and there are other alternatives in Si 
(which also have an indegree of zero). Since tie-breaking is lexicographic, agent n would not be selected. 
As the two events are independent, the probability of both occurring is therefore at most 1/4 + 1/2". We 
conclude that the approximation ratio of the mechanism cannot be smaller than 

We next provide a very simple, though rather weak, lower bound for the approximation ratio yielded by 
randomized SP /c-selection mechanisms. Let k G {1, .. . ,n — 1}, and let / : — > A( I 5^) be a randomized SP 
^-selection mechanism. Consider the graph G = (N,E) where 

E = {(i,i+l) : i = l,...,Jfc}U{(Jt+l,l)}, 

i.e., E is a directed cycle on the agents 1, . . . ,k+ 1. Then there exists an agent i G {1, . . . ,&+l}, without loss 
of generality agent 1, that is included in f(G) with probability at most k/(k+ 1). Now, consider the graph 
G' where E' = E \ {(1,2)}, that is, agent 1 removes its outgoing edge to agent 2. By strategyproofness, 
agent 1 is included in f(G') with probability at most k/(k + 1). Any subset S G =5^ such that 1 ^ S has at 
most k — 1 incoming edges in G'. It follows that the expected number of incoming edges in f(G') is at most 

k 1 k 2 + k-\ 

k+ (k— 1) = — . 

fc + 1 jfe + 1 v ; jt + 1 

Hence the approximation ratio of / cannot be smaller than 

k _ 1 

W+k=T ~ l + B + k-Y () 

We have therefore proved the following easy result. 

Theorem 4.2. Let N = {1, . . . ,n}, n > 2, k E {1,.. . ,n — 1}. Then there is no randomized SP k-selection 
mechanism with an approximation ratio smaller than 1 + £2(1 /k 2 ). 
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Not surprisingly, the lower bound given by Theorem 14. 21 converges to one, albeit more quickly than the 
upper bound of Theorem 14.11 As usual, an especially interesting special case is when k = 1. Equation © 
gives an explicit lower bound of two for this case. On the other hand, Theorem 14. 1 1 gives an upper bound of 
four. We conjecture that the correct value is two. 

Conjecture 4.3. There exists a randomized SP l-selection mechanism with an approximation ratio of two. 

One deceptively promising avenue for proving the conjecture is designing an iterative version of the 
Random Partition Mechanism. Specifically, we start with an empty subset S C N, and at each step add 
to 5 an agent from N\S that has minimum indegree based on the incoming edges from S, breaking ties 
randomly (so, in the first step we would just add to S a random agent). The last agent that remains outside 
S is selected. This SP mechanism does remarkably well on some difficult instances, but fails spectacularly 
on a contrived counterexample. We give a formal specification of this Sliding Partition Mechanism, and 
construct the illuminating counterexample, in Appendix 

4.2 GSP Randomized Mechanisms 

In the beginning of Section 14.11 we identified a trivial randomized SP ^-selection mechanism, namely the 
one that selects a subset of k agents at random. Of course this mechanism is even GSP, since the outcome is 
completely independent of the reported graph. 

We claim that selecting a random ^-subset gives an approximation ratio of n/k. Indeed, consider an 
optimal subset K* C N with \K*\ =k. Each agent i € K* is included in the selected subset with probability 
k/n, and hence in expectation contributes a (fc/«)-fraction of its indegree to the expected total indegree of 
the selected subset. By linearity of expectation, the expected total indegree of the selected subset is at least 
a (k/n) -fraction of the total indegree of K*. 

Theorem 14. 1 1 implies that we can do much better if we just ask for strategyproofness. If one asks for 
group strategyproofness, on the other hand, just selecting a random subset turns out to be optimal up to a 
tiny gap. 

Theorem 4.4. Let N = { 1 , . . . , n}, n > 2, and let k G {1 , . . . ,n — 1 }. No randomized GSP k-selection mech- 
anism can yield an approximation ratio smaller than (n — 1 ) jk. 

Proof. Let / : — > S^k be a randomized GSP mechanism. Given the empty graph, there are two agents 
i,j £ N such that each is selected with probability at most k/(n — 1). 

Consider the graph G' where E' = {(i,j), C/,i)}> tnat * s > there are only two edges in G', from i to j and 
from j to i. By group strategyproofness, it must hold for either i or j that f(G') selects this agent with 
probability not greater than under the empty graph; we may assume without loss of generality that f(G') 
selects i with probability at most kj(n — 1). 

Now consider the graph G" with E" = {(j,/)}. By strategyproofness, i is selected with equal probability 
under f(G') and f(G"), that is, with probability at most k/(n — 1). Since i is the only agent with an incoming 
edge in G", the approximation ratio is at least (n — \)/k. □ 

Note that Theorem 14.41 holds even if one is merely interested in coalitions of size at most two. 

5 Discussion 

In this section we discuss several prominent issues, and list some open problems. 
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Payments. If payments are allowed and the preferences of the agents are quasi-linear then truthful imple- 
mentation of the optimal solution is straightforward: simply give one unit of payment to each agent that is 
not selected. This can be refined by only paying "pivotal" agents that are not selected, that is, agents that 
would have been selected had they lied. However, even under the latter scheme we may have to pay all the 
non-selected agents (e.g., when the graph is a clique). Moreover, a simple argument shows that there is no 
truthful payment scheme that does better. 

The utility model. We have studied an "extreme" utility model, where an agent is only interested in the 
question of its own selection. The restriction of the preferences of the agents allows us to circumvent 
impossibility results that hold with respect to more general preferences, e.g., the Gibbard-Satterthwaite 
Theorem Ifl6ll23l and its generalization to randomized rules ifTTl . 

It is possible to consider a more sensitive utility function, where an agent receives a utility of one if it is 
selected, plus a utility of /3 > for each of its (outgoing) neighbors that is selected. In this model the social 
welfare (sum of utilities) of a set S of selected agents is k plus j8 times the total indegree of S. Hence, if 
j6 > 0, a set S maximizes the social welfare if and only if it maximizes the total indegree. In particular, if 
P > and payments are available, we can use the VCG mechanism E51 [U [191 (see [[20 ] for an overview) to 
maximize the total indegree in a truthful way. 

It is easy to verify that any upper bound in the 0-1 model (with total indegree as the target function) also 
holds in the j8-l model (with social welfare as the target function), hence Theorem 14. H is true for the latter. 
Furthermore, if not zero, j3 may still be very small in many settings, like those described in Section [T] In 
such cases a variation on the random partition mechanism achieves an approximation ratio close to one for 
the social welfare, even when k = I. Finally, note that if j8 > 1 then simply selecting the optimal solution 
(and breaking ties lexicographically) is SP. 

Weights and an application to conference reviews. A seemingly natural generalization of our model can 
be obtained by allowing weighted edges. Interestingly, our main positive result, namely Theorem 14. II also 
holds in this more general setting (subject to minor modifications to its formulation and proof). However, 
closer scrutiny reveals that it is our target function that is often meaningless in the weighted setting. Indeed, 
the absence of an edge between i and j would in this context imply that i has no information about j, 
whereas an edge with small weight would imply that i dislikes or distrusts j. Therefore, maximizing the 
sum of weights on incoming edges may not be desirable. 

That said, in very specific situations maximizing the sum of weights on incoming edges makes perfect 
sense; one prominent example is conference reviews. In this context the reviewers assign scores to papers 
while often submitting a paper of their own, and a subset of papers must be selected. This setting is special 
since it is usually the case that each paper is reviewed by three reviewers, i.e., each agent has exactly three 
incoming weighted edges, hence maximizing the sum of scores is the same as maximizing the average score. 
We conclude that m-RP can be employed to build a truthful conference program! 

Universal strategyproofness vs. strategyproofness in expectation. In the context of randomized mecha- 
nisms, two flavors of strategyproofness are usually considered. A mechanism is universally SP if for every 
fixed outcome of the random choices made by the mechanism an agent cannot gain by lying, that is, the 
mechanism is a distribution over SP mechanisms. A mechanism is SP in expectation if an agent cannot 
increase its expected utility by lying. In this paper we have used the latter definition, which clearly is the 
weaker of the two. On the one hand, this strengthens the randomized SP lower bound of Theorem 14.21 On 
the other hand, notice that the randomized mechanisms of Section |4] are in fact universally SP. Indeed, for 
every fixed partition, selecting agents from one subset based on incoming edges from other subsets is SP. 
Hence, Theorem 14. II is even stronger than originally stated. 
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Open problems. Our most enigmatic open problem is the gap for randomized SP 1 -selection mechanisms: 
Theorem 14. 1 1 gives an upper bound of four, while Theorem 14.21 gives a lower bound of two. We conjecture 
that there exists a randomized SP 1 -selection mechanism that gives a 2-approximation. 

In addition, a potentially interesting variation of our problem can be obtained by changing the target 
function. One attractive option is to maximize the minimum indegree in the selected subset. Clearly, our 
total impossibility for deterministic SP mechanisms (Theorem 13.11 ) carries over to this new target function. 
However, it is unclear what can be achieved using randomized SP mechanisms. 
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A Proof of Theorem 4.1 



For the first part of the theorem, consider an optimal set of k agents (which might not be unique), and denote 
it by K* C N. Let OPT be the sum of the indegrees of the agents in K* , that is, 

OPT= £ deg(i). 

ieK* 



We wish to show that the mechanism selects a ^-subset with an expected number of OPT/4 incoming edges. 
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Figure 3: An illustration of the proof of Theorem l4.ll for n = 8 and k = 4. In the given graph G, the optimal subset is 
K* = {1,2,3,4}. N is partitioned into S\ = {1,2,5,6} and S 2 = {3,4,7,8}, which partitions K* into A? = {1,2} and 
K* = {3,4}. We have that d x =d 2 = \. 

Consider some partition n of the agents into two subsets Si and S 2 - In particular, let K* be partitioned 
into K\ C 5i and K\ C 52, and assume without loss of generality that \K\\ > \K 2 \- Denote by d\ the number 
of edges from S 2 to K*, that is, 

d 1 = \{(i,j)eE: ieS 2 AjeKt}\, 

and similarly 

d 2 = \{(i,j)€E: i€SiAj€K%}\. 

See Figure [3] for an illustration. 

Note that step 2 of the 2-RP mechanism is equivalent to flipping a fair coin to determine whether we 
select \k/2] agents from Si and [k/2\ agents from S2 (when T = {1}), or vice versa (when T = {2}). 
Now, since < L^/^J (by our assumption that | > it follows that the subset of S2 selected by 

the mechanism has at least d 2 incoming edges, regardless of whether r = {l}orT = {2}, and even if 
1^2 1 < L^/^J- Moreover, since \Kl \ < \K*\ = kit holds that the subset of S\ selected by the mechanism has 
at least (\k/2~\/k) -di incoming edges if T = { 1}, and at least ( [k/2\ /k)-d\ if T = {2}. Therefore, we have 
that 

E [ MECH I K ] =E[MECH| K A T = {1} ] • - + E [ MECH | % AT = {!}]■- 
= y + ^2>^. 

For a random partition of the agents into Si and 52, each edge has probability 1/2 of being an edge 
between the two subsets, and probability 1/2 of being inside one of the subsets. Hence, by linearity of 
expectation, the expected number of edges incoming to K* that are between the two subsets is OPT/2. 
Formally, for a partition 71, let 5^ and 5J be the two subsets of agents, and let 

d* = g e : (iesfAje s^nK*) v (/ es$Aje sfnK*)}\. 

Then it holds that 

OPT 

£Pr[7rM* = ^-i (4) 

7t Z 
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We can now conclude that 

d % OPT 

E[MECH] =£E[MECH | n]-Pr[n] > £Pr[ % } ■ — = ——, 

where the second transition follows from (O and the third transition follows from ©. 

We now turn to the second part of the theorem. For ease of exposition, we will omit the various floors 
and ceilings from the proof, as we are looking for an asymptotic result. We employ one additional idea: if k 
is large enough, the random partition into fc 1 ' 3 subsets will be relatively balanced. A direct approach would 
be to bound the probability that the number of optimal agents in some subset deviates significantly from 
k 2 / 3 , and then proceed in a way similar to the first part. We however take a somewhat different approach 
that yields a better result. 

Consider the agents in the optimal set K*, and assume without loss of generality that K* = {1, . .. ,k}. 
Given i £ K* , we define a random variable Z, that depends on the random partition of N to Si,... ,S k i/3 as 
follows: 

Zi = \{jeK*\ {/} : 3t s.t. ieS t Aje S t }\, 

that is, Z, is the number of agents in the optimal set, excluding i itself, that are in the same random subset as 
agent i. We have 

E[MECH]= £ E[MECH\Z 1 =s 1 ,...,Z k = s k ]-Pr[Z 1 =s 1 ,...,Z k = s k ], (5) 

where the probability is taken over random partitions. 

Recall that the k l / 3 -RP Mechanism selects the top k 2 l 3 agents from each subset. Let 

a s = min{l,A: 2 / 3 /(^+l)}- 
Furthermore, given i € K* and a partition, let 

d\ = EE ; j € S h A i e S tl A?i / t 2 }\, 

i.e., d- is the number of edges incoming to agent i from other subsets. Using similar arguments to those 
employed to obtain ([3]), we get 



E [ MECH \Z 1 =s u ...,Z k = s k ]>E 



d 'i G H I Z x =Sl,...,Z k =S k 



ieK* 



£ E [ d[a Si | Zi = si, . . . ,Z k = s k ] . 



(6) 



ieK* 

We wish to obtain an explicit expression for E [ d[a s . \ Z\= s\,. . . ,Z k = s k \. For i £ ,/V and S C N, let 

deg(i,S) = \{U,i)£E: j £ S}\ 
be the indegree of agent i based on incoming edges from agents in S. We claim that 

E[d' i a Si \Zi=s u ...,Z k = s k ] = ( ^~]_~ Si ■ degfcg*) + ■ deg(t, N\K* ) j a sr (7) 

Indeed, this identity is obtained by using linearity of expectation twice, as any fixed agent in K* is not in the 
same subset as agent i with probability (k— 1 — Si)/(k— 1), and any fixed agent inN\K* is not in the same 
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subset as agent i with probability (fc 1 / 3 — l)/^ 1 / 3 . Notice that the expression on the right hand side of f7]) is 
independent of sj for all j ^ i. 

Combining ([5]>, ©, and (0), and reversing the order of summation, we conclude that 

E[MECH]>£ £ VT[Z l =sx,...,Z k = s k \- 

i£K* Si,...,s k 

(k — 1 — ? t J /3 _ 1 \ 

^-^•deg(/,^) + -^.deg(/^\r)j -a, 

£ £ Pr [Z, = , ] • ( 1-^- • deg(/,^) + • degfttf ] • a,. 



,v=0 



£ £ Pr [Z, = , ] • • deg(/,r ) • a s + 



ieK' s =0 



k-1 fr!/3_1 

I EMZ» = *]'^ /j— deg(i,iV\jr)-a,. 



i€K*s=0 K 



On the other hand, we have that 



OPT= £ (deg(i,K*)+deg(i,N\K*)) = £ deg(i,Z*) + £ deg(i,N\K*). 

ieK* ieK* ieK* 

In order to complete the proof it therefore suffices to prove that for every ieK*, 

: fT, = - ^ 



S=0 

and 



£Pr[Z ! =,].^ T73 -.a v = l-^(^j, (8) 



tMZ i = s] k - T ^.o s = X-^(^\. (9) 



Using these equalities we may conclude that 

OPT £ ier degftAr*)+£, eir .deg(;,M\Ar*) 



1 MECH 1 Lap (l-<?(p ! n))deg(/,S') + D«. (l -^(jfc))deg(i,A'\K-) 



1 i + e ±-\ 



Since a s = 1 for all s < & 2 / 3 — 1, in order to establish (HJ we must show that 



Indeed, 



£ Pr [Z,- = s]-^— < £ Pr Z>k 2 / 3 + (x-\)k 1 / 3 



jfe2/3 +^V3 + 1 



+ Pr 



Zi>0 3 +2^/]ogk-k 1/3 
15 



• 1. 



(10) 



In order to bound the probabilities on the right hand side of (fTOb we employ the following version of the 
Chernoff bounds (see, e.g., 0, Theorem A. 1.11). 

Lemma A. 1. LetX\, ...,Xk be i.i.d. Bernoulli trials, Pr[Xj = 1 ] = pfori= l,...,k, and denote X = Y!i=i^i- 
In addition, let X > 0. Then 

( X 2 X 3 
Pr[X-kp>l]<exv (-— + 



2kp 2(kp) 2 J ' 

Z; is in fact the sum of k — 1 i.i.d. Bernoulli trials, but we can safely assume that it is the sum of k trials 
if we are interested in an upper bound on the probability of the sum being greater than some given value. 
Using Lemma lATTI with X = xk l l 3 and p = \/k l l 3 we get 

{x-ir 



Pr 



Zj>k 2/3 + (x-l)k 



1/3 



< 



exp 



x- l) 2 k 2 / 3 (x- l) 3 k 



2fc 2 /3 2£ 4 / 3 

where the second inequality holds for a large enough k. Similarly, 

Pr Zi > k 2/3 + 2 x/logfc • k l/3 } < exp 



< 



40 3 logk + Sk(logk) 3 ^ 



exp 



<exp(-log&) < -. 



(11) 



2£ 2 / 3 2£ 4 / 3 

We conclude that the expression on the right hand side of ( fTOb is bounded from above by 
U-l) 2 \ x&P + l 



2v/EP / 



\ 11 ( {x-\) 2 \ „ \ 1 

^/3 + ^/3 + l) + ^F73 £ (exp(-W^-2*)+. 



which follows from the fact that the series Y^c=\ exp(— @(x 2 )) • &(x) converges. This establishes ([8]). 
The proof of Q is similar to that of ([8]). It is sufficient to show that 



/t 2 / 3 +2 v /lop-/fe'/ 3 -l 



s=0 



k-l 



+ 



+ Pr 



I 

S=jfc 2 / 3 



Pi[Zi = s] ^1 
Z t >k 2/3 + 2^/logk-k 1/3 



k-l-s k 2 / 3 



It holds that 

and as before, 
Finally, 



A+ 3 -l _ * 2 / 3 -l P/3_1 

£ Pr[2i = 5]. r i T < £ Pr[Z = ^ 



fc-1 s+l 
.k^)- 



1 



.5=0 



5=0 



Pr 



Zi >k 2/3 + 2 v / iogk-k 



1/3 



£-1 



• 1 < 



/t 2 / 3 +2 v /IogI-<: 1 / 3 -l 



I Pr[Z, = ,]ll 

i=i 2 / 3 V 

fc 2 '' 3 +2 v /EP-/t 1 / 3 -l 

£ Pr[Z = ,]|l-|i 



>k- 1 -5 k 2 l 3 



S=ft2/3 



fc-1 5+1 



1 

FT 3 



fc 2 / 3 ' 

5+1 



We can thus bound this sum from above as before using (fTTT ). This completes the proof of Theorem 14. II 
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Figure 4: Example for the Edge Scan Mechanism. Given this graph, the mechanism would select agent 4 in the scan 
from left to right, and agent 3 in the scan from right to left, so the subset of agents selected by the mechanism is {3,4}. 

B The Edge Scan Mechanism 

In Theorem 13. II we have seen that a deterministic SP ^-selection mechanism cannot give a bounded approx- 
imation ratio. We now show that if we are allowed to choose at most k agents, for k > 2, it is possible to 
design an SP mechanism with a bounded approximation ratio. As noted in Section [3l it is sufficient to select 
a subset with an incoming edge, if one exists. 

Intuitively, the mechanism, which we refer to as the Edge Scan Mechanism, first orders the agents from 
left to right according to their lexicographic ordering. The mechanism then scans the agents from left to 
right, until it finds an outgoing edge directed to the right, and selects the agent the edge is pointing at. 
Similarly, the mechanism scans the agents from right to left until it finds an edge that is directed to the left, 
and also selects the agent that this edge is pointing at. An example is shown in Figure [4] What follows is a 
more formal specification of the mechanism. 

The Edge Scan Mechanism. 

1. Partition £ into E\ = G E : i < j} and £2 = {(hj) G E : i > j}. 

2. If E\ 7^ 0, let i £ N be the minimum index such that there exists j £ N with G E\; add to the 
subset the minimum j such that G E\. Otherwise, add agent n to the subset. 

3. If £2 7^ ®> let i G N be the maximum index such that there exists j £ N with G £2; add to the 
subset the maximum j such that (i,j) G £2. Otherwise, add agent 1 to the subset. 

The Edge Scan Mechanism is clearly SP. Indeed, agent i cannot benefit from adding outgoing edges, 
since these edges would only point at some other agent. It also cannot benefit from removing outgoing 
edges. Informally, if the mechanism reaches the point in the scan (from left to right or right to left) where 
the agent's vote is taken into account, then it is too late for agent i itself to be elected. 

Moreover, either E\ or £2 will contain an edge given that there is at least one edge in the graph, and 
the Edge Scan Mechanism is guaranteed to select an agent with an incoming edge in this case. It therefore 
achieves a finite approximation ratio, although this ratio can be as bad as Q.{nk). 

Crucially, the agents selected in both steps of the mechanism can be one and the same; in this case the 
mechanism would return a singleton subset. A curious implication of Theorem 13. II is that such a selection 
cannot be completed deterministically and in a strategyproof way to obtain a subset of size two. 

C The Sliding Partition Mechanism 

In this appendix we discuss the Sliding Partition Mechanism, informally presented in Section @] The mech- 
anism is randomized, and was designed to yield an SP upper bound better than four for the k = 1 case. 
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Although the mechanism ultimately fails in achieving this goal, we believe that the counterexample is sur- 
prising and may prove helpful in future attempts to resolve Conjecture 14.31 We start with an informal 
specification of the mechanism. 

The Sliding Partition Mechanism. 

1. Let S = 0. 

2. While \S\ < n — 1, choose i ^ 5 that has minimum indegree based on edges from agents in S, breaking 
ties randomly. Let S = S U {/}. 

3. Select the agent in N \ {S}. 

When an agent is added to 5, we say that it is eliminated. It is easy to see that this mechanism is SP. 
Indeed, only the outgoing edges of eliminated agents are taken into account at any stage. Once an agent 
is eliminated, it no longer has a chance to be selected, therefore it is indifferent to the outcome of the 
mechanism. 

Another interesting observation is that the Sliding Partition Mechanism gives a 2-approximation for the 
example where the analysis of the 2-RP mechanism is tight: a graph with only one edge. Indeed, if G has 
one edge (i,j), then j is certainly elected once i is eliminated (since then it is the only agent in N\S with an 
incoming edge from S), and i is eliminated before j with probability 1/2. 

Unfortunately, it is possible to construct a graph where the mechanism does very poorly. For this, 
consider a tree with agent 1 at the root. There is a set T C of size « 3 / 5 of agents with outgoing edges to 
1, that is, deg(l) = n 3 ^ 5 . In addition, each agent in T has n 2 / 5 incoming edges from agents in ({1} U T). 
The agents in./V\({l}ur) have an indegree of zero. 

Notice that while there are agents in N\S that have no incoming edges from S, the mechanism selects 
one of these agents uniformly at random and eliminates it. Consider the first point in time to when all 
the agents in T \ S that were not yet eliminated have at least one incoming edge from S; we can assume 
without loss of generality that at this point agent 1 has not been eliminated. We claim that if less than « 2 / 5 
agents from T have been eliminated at time to, then agent 1 is guaranteed to be eliminated later on. Indeed, 
stalling at to, the remaining agents inAf\({l}ur) are eliminated one after the other (in some random 
order), because all of them have indegree zero from S, while all other remaining agents have at least one 
incoming edge from S. After all the agents in N\ ({1} U T) have been eliminated, each agent in T \ S has 
7i 2 / 5 incoming edges from S. By assumption agent 1 has less and is eliminated next. 

We now claim that with high probability, agent 1 has less than n 1 ^ incoming edges from S at time 
to- Each agent i € T contributes an edge to 1 at time to if and only if it is eliminated before any of the 
agents in its incoming neighborhood; this happens with probability roughly l/n 2 / 5 . Therefore, by linearity 
of expectation, the expected number of edges from S to agent 1 at time to is roughly n l l 5 . The claim now 
follows directly from Chernoff 's inequality. 

We conclude that the approximation ratio provided by the Sliding Partition Mechanism cannot be smaller 
than Q.(n l l 5 ). By optimizing the parameters of the example, it is possible to obtain an even stronger lower 
bound. 
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